## **Computer Architecture**

## Lab 10

In this lab, we'll use the Data Cache Simulator tool in MARS to study cache memory.

You should turn in the source file for the program you're asked to write in Step 3. It should be submitted via the tool available on Sakai for this lab. It is due by the start of the next class period.

## **Using the Cache Simulation Tool**

- 1. Close any MIPS programs you are currently using.
- 2. Open the program row-major.asm. This program will traverse a 16 by 16 element integer matrix in row-major order, assigning elements the values 0 through 255 in order. It performs the following algorithm:

```
for (row = 0; row < 16; row++)
  for (col = 0; col < 16; col++)
    data[row][col] = value++;</pre>
```

- 3. Assemble the program.
- 4. From the **Tools** menu, select **Data Cache Simulator**. A new frame will appear in the middle of the screen.



This is a MARS Tool that will simulate the use and performance of cache memory when the underlying MIPS program executes. Notice its three major sections:

- Cache Organization: You can use the combo boxes to specify how the cache will be configured for this run. Feel free to explore the different settings, but the default is fine for now.
- Cache Performance: With each memory access during program execution, the simulator will determine whether or not that access can be satisfied from cache and update the performance display accordingly.
- Tool Control: These buttons perform generic control functions as described by their labels.
- 5. Click the tool's **Connect to MIPS** button. This causes the tool to register as an observer of MIPS memory and thus respond during program execution.
- 6. Back in MARS, adjust the **Run Speed slider** to 30 instructions per second. It is located at the right side of the toolbar. This slows execution so you can watch the Cache Performance animation.



- 7. In MARS, run the program using the **Run** toolbar button , the menu item or keyboard shortcut. Watch the Cache Performance animate as it is updated with every access to MIPS memory.
- 8. What was the final cache hit rate? \_\_\_\_\_\_. With each miss, a block of 4 words are written into the cache. In a row-major traversal, matrix elements are accessed in the same order they are stored in memory. Thus each cache miss is followed by 3 hits as the next 3 elements are found in the same cache block. This is followed by another miss when Direct Mapping maps to the next cache block, and the patterns repeats itself. So 3 of every 4 memory accesses will be resolved in cache.
- 9. Given that explanation, what do you predict the hit rate will be if the block size is increased from 4 words to 8 words? \_\_\_\_\_\_. Decreased from 4 words to 2 words? \_\_\_\_\_\_.
- 10. Verify your predictions by modifying the block size and re-running the program from step 7. *NOTE*: when you modify the Cache Organization, the performance values are automatically reset (you can always use the tool's **Reset** button).

NOTE: You have to **reset** the MIPS program before you can re-run it.

NOTE: Feel free to adjust the **Run Speed slider** to maximum speed anytime you want.

11. Repeat steps 2 through 10 for program **column-major.asm**. This program will traverse a 16 by 16 element integer matrix in column-major order, assigning elements the values 0 through 255 in order. It performs the following algorithm:

```
for (col = 0; col < 16; col++)
  for (row = 0; row < 16; row++)
    data[row][col] = value++;</pre>
```

NOTE: You can leave the Cache Simulator in place, move it out of the way, or close it. It will

|     | not interfere with the actions needed to open, assemble, or run this new program and will remain connected to MIPS memory. If you do not close the tool, then skip steps 4 and 5.                                                                                                                                       |
|-----|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 12. | What was the cache performance for this program? The problem is the memory locations are now accessed not sequentially as before, but each access is 16 words beyond the previous one (circularly). With the settings we've used, no two consecutive memory accesses occur in the same block so every access is a miss. |
| 13. | Change the block size to 16. Note this will reset the tool.                                                                                                                                                                                                                                                             |
| 14. | Create a second instance of the Cache Simulator by once again selecting <b>Data Cache Simulator</b> from the <b>Tools</b> menu. Adjust the two frames so you can view both at the same time. Connect the new tool instance to MIPS, change its block size to 16 and change its number of blocks to 16.                  |
| 15. | Re-run the program. What is the cache performance of the original tool instance?                                                                                                                                                                                                                                        |